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Abstract 

The fundamental diversity-multiplexing tradeoff of the three-node, multi-input, multi-output (MIMO), quasi-static, 
Rayleigh faded, half-duplex relay channel is characterized for an arbitrary number of antennas at each node and in 
which opportunistic scheduling (or dynamic operation) of the relay is allowed, i.e., the relay can switch between 
receive and transmit modes at a channel dependent time. In this most general case, the diversity-multiplexing tradeoff 
is characterized as a solution to a simple, two-variable optimization problem. This problem is then solved in closed 
form for special classes of channels defined by certain restrictions on the numbers of antennas at the three nodes. 
The key mathematical tool developed here that enables the explicit characterization of the diversity-multiplexing 
tradeoff is the joint eigenvalue distribution of three mutually correlated random Wishart matrices. Besides being 
relevant here, this distribution result is interesting in its own right. Previously, without actually characterizing the 
diversity-multiplexing tradeoff, the optimality in this tradeoff metric of the dynamic compress-and-forward (DCF) 
protocol based on the classical compress-and-forward scheme of Cover and El Gamal was shown by Yuksel and 
Erkip. However, this scheme requires global channel state information (CSI) at the relay. In this work, the so-called 
quantize-map and forward (QMF) coding scheme due to Avestimehr et al is adopted as the achievability scheme with 
the added benefit that it achieves optimal tradeoff with only the knowledge of the (channel dependent) switching time 
at the relay node. Moreover, in special classes of the MIMO half-duplex relay channel, the optimal tradeoff is shown 
to be attainable even without this knowledge. Such a result was previously known only for the half-duplex relay 
channel with a single antenna at each node, also via the QMF scheme. More generally, the explicit characterization 
of the tradeoff curve in this work enables the in-depth comparisons herein of full-duplex versus half-duplex relaying 
as well as static versus dynamic relaying, both as a function of the numbers of antennas at the three nodes. 
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I. Introduction 

Cooperative communication techniques can advantageously utilize the fading environment of a wireless network 
to provide better reliability and/or rate [2], [3]. The simplest theoretical abstraction of a cooperative communication 
network is the 3-node relay channel (RC), where the relay node helps the communication between the source and 
destination nodes by forwarding an appropriately processed version of the source message received at the relay 
node to the destination. Moreover, multiple antennas at the three nodes can markedly boost rate and reliability 
performance by allowing for the exploitation of the inherent combined MIMO and cooperative communication 
gains. 




(a) CNi: A mobile set acts as a relay (b) CN2: A smaller base station acts as a (c) CN3: A sensory network with a mo- 
relay bile relay station (MRS) 

Fig. 1: Three Examples of Cooperative Networks. 

MIMO relay channel communications can be considered for various applications. For instance, Fig. 1 depicts 
three different cooperative communication scenarios to which the theory of this work applies. Fig. 1(a) depicts a 
cellular network, denoted CNi, wherein a mobile user (or mobile set (MS)) uses another mobile user as the relay 
station (RS) to communicate its message to and from the base station (BS). This cooperative model was proposed 
in [2]. Fig. 1(b) depicts a scenario where, in a cellular network (denoted CN2), a particular cell area is divided into 
more than one sub-cell and each sub-cell is served by an additional dedicated node (a smaller BS) to provide better 
quality of service. Thus each user in these sub-cells can use this dedicated node to relay their messages to and from 
the BS. The CN2 network is different from CNi in the sense that in it the relay station can host a larger number 
of antennas. It is under consideration to be implemented in LTE-advanced and WiMAX technologies [4] and being 
standardized for broadband wireless access by the IEEE 802.16's relay task group [5] for expanded throughput 
and coverage with deployment of relay stations of complexity and cost lower than that of legacy base stations but 
higher than that of mobile stations [5]. A third example of a cooperative network (denoted CN3) is the sensor 
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network of Fig. 1(c) (cf. [6]), where a more capable mobile relay station (i.e., with more antennas) helps several 
less capable sensor nodes to communicate with each other. It is possible to give other examples, see for instance, 
the application of cooperative communication in ad-hoc networks in [7]. Note that the numbers of antennas at the 
different nodes vary across the applications and also depend on whether the communication is uplink or downlink 
(such as in the CNi and CN2 networks), which points to the importance of studying MIMO relay channels with 
an arbitrary number of antennas at each node. 

The relay has two phases of operation: the listen phase, in which it receives the signal from the source, and 
the transmit phase, in which it transmits some version of the received signal to the transmitter. If the relay can 
simultaneously operate in both phases it is called a full-duplex (FD) relay and the corresponding channel is called 
a full duplex relay channel (FD-RC). Otherwise, if the relay can only operate in one phase at a time it is called 
a half-duplex (HD) relay and the corresponding channel a half duplex relay channel (HD-RC). Due to the large 
difference between the power levels of the transmitted and received signals however, it is difficult, if not impossible, 
to design FD relays cost- and space-efficiently. The focus of this paper is hence on MIMO HD-RCs. 

Cooperative protocols proposed and analyzed for the HD-RC can be divided into different classes. If a protocol 
uses the CSI at the relay to opportunistically decide the switching time - the time at which it switches between the 
listen and transmit phases - it is called a dynamic protocol. Dynamic protocols considered in the literature include 
the dynamic decode-and-forward (DDF) protocol of [8] and the dynamic compress-and-forward (DCF) protocol 
of [9]. Otherwise, if the relay is restricted to switch between the listen and transmit phases at a pre-determined, 
channel independent time, it is called a static protocol. An important example is the static compress-and-forward 
(SCF) protocol of [9], [10]. An HD-RC on which protocols are restricted to be static is called a static HD-RC, 
and one on which they are not, is called a dynamic HD-RC (or simply HD-RC) since any static protocol can be 
thought as a special case of the family of dynamic protocols. On the other hand, the transmit-receive phases on 
an HD-RC can be thought of as states and additional information can be conveyed to the receiver through the 
sequence of these states. A cooperative protocol that uses these states to send additional information is called a 
random protocol, otherwise it is a fixed protocol (see [11], [9]). 

In this paper we focus on the general three-node MIMO HD-RC, i.e., in which there are an arbitrary number of 
antennas at each node and in which there are no constraints on the relay operation so that it can operate via the 
static or dynamic and random or fixed mode. In order to avoid repeated use of a complete descriptor of a channel 
we will use simplified ones when the meaning is unambiguous from the context. For example, we may refer to the 
dynamic MIMO HD-RC sometimes simply as the relay channel because this channel is the central focus of this 
paper. Similarly, we may refer to the MIMO FD-RC or the static MIMO HD-RC as the FD-RC or the static RC, 
respectively, when the meaning is clear. 

In spite of its apparent simplicity, neither the capacity nor the diversity-multiplexing tradeoff [12] of the 3-node 
MIMO HD-RC is known till date. However, in a recent paper [13] the capacity of this channel was characterized 
within a constant number of bits. It was proved that the so called quantize-map and forward (QMF) scheme can 
achieve a rate which is within a constant number of bits to the cut-set upper bound of the channel. On a slow fading 
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HD-RC however, the instantaneous end-to-end mutual information, and therefore the cut-set bound of the channel, 
is a random quantity. A meaningful measure of performance on this channel is hence the outage probability which 
is a measure of reliability as a function of the communication rate in that it represents the (minimum) probability 
with which a particular rate cannot be supported on the channel. The result of [13] provides upper and lower bounds 
on this outage probability, both in terms of the instantaneous cut-set upper bound of the channel, denoted as C(H) 
for a channel realization of H, as 

Pr{(7(H) <R}< V out < Pr{C(n) <R-k}, 

where R is the operating rate and n is a positive constant independent of the channel parameters and the signal- 
to-noise ratio (SNR) of the channel (e.g., see Theorem 8.5 in [13]). The exact evaluation of the outage probability 
requires both these bounds to be tight which in turn requires the exact capacity of the channel. Instead, in this paper 
we focus on the asymptotic (in SNR) behavior of the tradeoff between rate and reliability as captured by the DMT 
metric, first introduced in the context of the point-to-point MIMO channel by Zheng and Tse in [12]. Since it was 
proved that a random protocol can increase the capacity by at most one bit in [9], there is no distinction between 
random and fixed protocols in the DMT framework. Thus, from the DMT perspective, characterizing the DMT of 
the HD-RC by allowing for dynamic operation of the relay but restricting it to the fixed mode still amounts to 
characterizing the fundamental DMT of the HD-RC. It is noted that the DMT of the static MIMO HD-RC was 
recently obtained by Leveque et al in [10]. It is shown here that in general a restriction that relay protocols be 
static fundamentally limits DMT performance over the MIMO HD-RC. 

Since its first introduction and pioneering work by Van der Meulen [14] and the subsequent significant progress 
made by Cover and El Gamal in [15], the relay channel and its more general versions have been analyzed from 
both the capacity perspective in [16], [7], [17], [18], [13], [19] and from the diversity, or more generally, the DMT 
perspectives for the 3-node relay network in quasi-static fading channels in [20], [21], [22], [23], [8], [24], [9], [25], 
[10], [1]. The earliest works demonstrating the improved reliability of the relay channel in terms of the diversity 
gain compared to the corresponding point-to-point (PTP) channel were reported in [20], [21], [22], [23], where a 
number of simple cooperative protocols were proposed and their DMT performance was analyzed. Later in [23], 
[8], [24], more efficient protocols were introduced. Notable among these were the dynamic decode-and-forward 
(DDF) protocol which is DMT optimal on a single antenna relay channel for a range of low multiplexing gains and 
the so called enhanced dynamic decode-and-forward (EDDF) of [24]. All of the above protocols were analyzed for 
the relay channel with single antenna nodes ([23] considers multiple antennas at the destination). 

Multiple antenna relay channels were studied by Yuksel and Erkip in [9], where the DMTs of a number of 
protocols were evaluated and the DMT optimality of the compress-and-forward (CF) coding scheme of [15] for 
the MIMO FD- and HD-RCs was proved. In the DCF protocol of [9], the relay node utilizes all the instantaneous 
channel realizations, i.e., global CSI, to compute the quantized signal and the optimal switching time of the relay 
node. However, global CSI at the relay is not necessary to achieve DMT optimal performance as we discuss next. 

The static QMF protocol of [13] can achieve the cut-set bound of the HD-RC to within a constant gap that is 
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independent of CSI and SNR for a. fixed scheduling of the relay, i.e., for an a priori fixed time td at which the relay 
switches from listen to the transmit mode (e.g., see Theorem 8.3 in [13]), and do so without knowledge of CSI 
at the relay. However, on a slow fading dynamic HD-RC the cut-set bound, denoted by C(T-L,td), is a function of 
global CSI including td and hence the optimal switching time t* d that maximizes the cut-set bound can be a function 
of the instantaneous channel matrices. If (just) this switching time information is hence available at the relay node 
it then follows that the QMF protocol can achieve a rate that is within a constant gap to C{T-L,t* d ) without requiring 
global CSI at the relay. Henceforth, the QMF protocol that operates with a dynamic (channel-dependent) switching 
time of the relay node will be referred to as the dynamic QMF protocol. Since a constant gap is irrelevant in the 
DMT metric, the dynamic QMF protocol in which the relay switches from listen to transmit modes at t* d achieves 
the fundamental DMT of the MIMO HD-RC with only knowledge of t* d at the relay, as opposed to global CSI 
% required by the DCF protocol of [9]. The above discussion shows that the DMT of the static MIMO HD-RC 
found in [10] for the static CF protocol requiring global CSI at the relay also applies to the static QMF protocol 
and hence to the static MIMO HD-RC without any CSI at the relay. 

In this paper, we are interested in establishing the DMT of the dynamic MIMO HD-RC. While the optimality 
in the DMT metric of the DCF was shown in [9] and that of the dynamic QMF protocol [13] is evident from 
the discussion above, the characterization of this optimal performance, i.e., the fundamental DMT of the MIMO 
HD-RC, is not yet known and is the subject of this paper. The key mathematical tool that prevented its computation 
thus far is the joint eigenvalue distribution of three mutually correlated random Wishart matrices. Here we obtain 
this distribution as a stepping stone to characterizing the DMT of the MIMO HD-RC. Not only is this distribution 
result interesting in its own right as a problem in random matrices, it is also arises in establishing the DMT of the 
MIMO interference channel as was done by the authors in [26], [27]. 

The explicit DMT of the MIMO HD-RC evidently would serve as a theoretical benchmark against which the 
performances of the various cooperative protocols proposed and analyzed in the literature can be compared. Further, 
cooperative protocols which are suboptimal but cost-efficient provide the system designer with an option to trade 
performance and complexity if their performance loss can be quantified relative to optimal performance. Moreover, 
the answers of a number of interesting and open questions hinge on the explicit characterization of the DMT of 
the MIMO HD-RC. For instance, while the DMT of the MIMO FD-RC is an upper bound to that of the MIMO 
HD-RC, it is not known whether the latter is strictly worse than that of the former. The question is especially 
intriguing in light of the result by Pawar et at in [28] where it was shown that the DMT of the single-antenna 
(or single-input, single-output (SISO)) HD-RC is identical to that of the FD-RC. Comparing with the DMT of the 
MIMO FD-RC which was found in [9], this question can be resolved if the explicit DMT of the MIMO HD-RC can 
be characterized. There are also open questions regarding the comparative performances of the static and dynamic 
MIMO HD-RCs. Although intuitively it seems that the dynamic HD-RC should have a better DMT than the static 
HD-RC, there is no theoretical proof of this thesis to date. For instance, in the SISO case, there is no difference 
in the DMTs of the static and dynamic HD-RCs as shown in [28] because the DMT of the static QMF protocol 
coincides with that of the SISO FD-RC. The question here is whether this result continues to hold in the more 
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general static and dynamic MIMO HD-RCs. This question can be answered if the DMT of the (dynamic) MIMO 
HD-RC were to be found, since in this case, one could simply compare with the DMT of the static MIMO HD-RC 
of [10]. 

This paper answers the two questions raised above in the negative. In particular, the results of this paper, examples 
from which are shown in Fig. 2(a) and Fig. 2(b) depicting the DMTs of the HD-RC with single-antenna source and 
destinations but with two and four antennas at the relay (the (1, 2, 1) and (1, 4, 1) RCs (applicable for example, to 
CN 3 of Fig. 1(c)), respectively, show that in general neither is the DMT of the static MIMO HD-RC always equal 
to that of the corresponding dynamic MIMO HD-RC, nor is the DMT of the MIMO FD-RC always identical to 
that of the corresponding MIMO HD-RC. 




0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 

Multiplexing gain ( r ) Multiplexing gain ( r ) 

(a) (1, 2, 1) relay channel (b) (1, 4, 1) relay channel 



Fig. 2: Comparison of DDF and SCF protocol with the fundamental DMT of the (l,k,l) relay channel. 

Besides resolving the above discussed problems, the explicit DMT computed in this paper provides sharper 
answers about the MIMO HD-RC. Denoting an RC with to, k, and n antennas at the source, relay and destination 
as the (m, k, n) RC, they include, but are not limited to the following: 

• While in general the DMT performance of the MIMO HD-RC is inferior to that of the corresponding MIMO 
FD-RC, it is found that for two classes of channels, namely (a) the (to, k, n) RCs with m > n > k and (b) the 
(n, 1, n) RCs, the DMTs of the HD- and FD-RCs are identical (see Remark 11). Therefore, for these classes 
of RCs, an FD relay does not improve the DMT performance over that of the corresponding HD-RC. 

• It is well known from [9] that the fundamental DMT of (to, k, n) FD-RC is given by 

min{^ +fe) n (r),^ p (ti+fc) (r)} , < r < min{TO,n}, (1) 

where cff*|(r) represents the DMT of the px q MIMO point-to-point channel [12]. From this it is clear that an 
additional antenna at the relay node strictly improves the DMT performance of an FD-RC at all multiplexing- 
gains. However, this is not true for the MIMO HD-RC. When k is large enough, an extra antenna at the relay 
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node does not further improve the DMT of the HD-RC for high multiplexing gains (see Remark 5). 
• In general, for a set of high multiplexing gain values, the optimal DMT of the MIMO HD-RC channel can 

be achieved without CSI at the relay node. As the number of antennas at the relay node increases, the size of 

this set increases (e.g., see Fig. 6(b)). 
. Finally, it is proved that the DMT of the (l,fc, 1) and the (n, l,n) HD-RC can be achieved by the QMF 

protocol without CSI at the relay node, i.e., neither global CSI nor even the switching time information is 

necessary at the relay node. To the best of our knowledge, this is the first result regarding the achievability of 

the DMT of a non-SISO HD-RC without CSI at the relay node. 
The rest of the paper is organized as follows. In Section II, we describe the system model and provide some 
preliminaries including the asymptotic joint distribution of the eigenvalues of three specially correlated random 
matrices which will be used later to derive the fundamental DMT of the MIMO HD-RC. In Section III, we 
characterize the fundamental DMT of the MIMO HD-RC as the solution of a simple optimization problem in three 
steps: 1) in Subsection III-A we obtain an upper bound on the instantaneous capacity; 2) in Subsection III-B, we 
obtain a lower bound on the instantaneous capacity as the achievable rate of the dynamic QMF protocol, which is 
within a constant gap from the upper bound, and finally, in Subsection III-C, we characterize the DMT as a solution 
to an optimization problem, which we subsequently simplify to a 2-variable optimization problem. In Section IV, 
we provide closed-form expressions for the DMT of different classes of MIMO HD-RCs including the class of 
symmetric (n, k, n) RCs and then prove in Section V that the DMT of (1, k, 1) RC and (n, 1, n) HD-RC can be 
achieved without CSI at the relay node. Section VI concludes the paper. 

Notations: (x) + , x Ay, \S\, det(X) and (Xy represent the number max{a;, 0}, the minimum of x and y, the 
size of the set S, the determinant and the conjugate transpose of the matrix, X, respectively. The symbol diag(.) 
represents a square diagonal matrix of corresponding size with the elements in its argument on the diagonal. /„ 
represents an n x n identity matrix. We denote the field of real and complex numbers by R and C, respectively. 
The set of real numbers between r% £ M and ra(> ri) G K will be denoted by [ri,r2]. The set of all n x m 
matrices with complex entries is denoted as C nxm . The distribution of a complex Gaussian random vector with 
zero mean and covariance matrix S is denoted as CAf(0, £). The trace of a square matrix A, is denoted as Tt(A). 
A >z B (or A >- B) would mean that (A — B) is a positive-semidefinite (psd) matrix (or positive-definite (pd) 
matrix), respectively. Pr(£) represents the probability of the event £ . All the logarithms in this text are to the base 
2. Finally, any two functions f(p) and g(p) of p, where p is the signal-to-noise ratio (SNR) defined later, are said 
to be exponentially equal and denoted as f(p)=g{p) if, 



lim 



log(/(p)) 
log(p) 



= lim 

p— foo 



log(g(p)) 
log(p) 



(2) 



< and > signs are defined similarly. 
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Fig. 3: System model of the MIMO 3-node relay channel. 

II. System model and preliminaries 

We consider a quasi-static, Rayleigh faded MIMO HD-RC, with a single relay node as shown in Fig. 3. It is 
assumed that the source, destination and relay nodes have m, n and k antennas, respectively. Let Hsr G C fcxm , 
Hsd G C™ xm and Hrd € C nxfc model the fading channel matrices between the source and relay, source and 
destination and relay and destination nodes, respectively. For economy of notation, the set of these channel matrices 
is denoted as %. It is assumed that these matrices are mutually independent and their elements are independent and 
identically distributed (i.i.d.) as CAf(0, 1). 

The channel matrices remain constant within a block of Lb channel uses, where Lb is the block length of the source 
codeword. Suppose that during the first tdLb symbol times the relay node only listens to the source transmission 
and during the remaining (1 — td)Lb symbol times it transmits its own codeword X r G C fcx< - 1_tci - )L6 , where 
td G (0, 1). In what follows, the listening phase and the transmitting phases of the relay node will be denoted by 
Pi and p2, respectively, and the fraction td is called the switching time. Since the relay node operates dynamically, 
this switching time should be chosen to maximize the end-to-end instantaneous mutual information and can thus 
depend on all of %. 

We assume that the destination and relay nodes have global CSI H, but the source node does not have any CSI. 
The relay node in this channel model is more capable than that on a relay channel with only receive CSI (CSIR) 
at all the nodes. 1 We assume short term power constraints at the source and relay, i.e., these nodes cannot allocate 

'Note that the DMT of the relay channel with only CSIR at all the nodes can not be better than the DMT of the relay channel considered in 
this paper, since the relay node can choose to not use any CSI except H^r- In fact, intuitively it seems that the latter may use the additional 
information (Hgjj and Hrd) at the relay for instance to optimize the time to switch from its listening mode to the transmit mode and achieve 
a better DMT than the former, on which the switching time can only be a function of H^a- Interestingly, in this paper we shall prove that 
depending on the number of antennas at different nodes the DMT of the above two relay channels (with and without global CSI at the relay 
and destination node) can be identical, cf. Section V. 
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power across different fades of the channel as a function of "H, see equation (6). 

Further, we also assume that the source and relay nodes transmit information at fixed rates; in particular, the 
relay node does not use the available transmit CSI (CSIT) to transmit information at a variable, channel dependent 
rate. Note that an information outage can be avoided on a communication link if CSIT is used to allocate power 
across different fading blocks (cf. [29], [30], [31]) under the so-called long-term power constraint and/or transmit 
information at a variable rate as a function of the instantaneous channel realizations. It was shown in [31] that the 
DMT of a point-to-point MIMO channel can be improved by either of these two techniques. 

Denoting the signals transmitted by the source at time t in phases one and two as Xg 1 [t] and X$ 2 [t], respectively, 
and the signal transmitted by the relay at time t as X R [t] (in phase two), the received signals at the destination and 
relay nodes in phase one are given as 

Y Dl [t] = HsdXsM+ZdM, (3) 
Y R [t] = H SR X Sl [t} + Z R [t], (4) 

and the received signal at the destination node in phase two is given as 

Y D2 [t] = H SD X S2 [t] + H RD X R [t] + Z D2 [t], 

where Zu^t], Zn 2 [t] 6 C nxl and Z R [t] e C hl represent mutually independent additive noise random vectors 
at the destination and relay nodes, respectively. All the entries of these random vectors are assumed to be i.i.d. 
£/V(0, 1). The power constraints at the relay and the source nodes are 2 



j- J Tr (E (X Sl [t]X Sl [t]t)) + £ Tr (E (X S2 [t]Xs 2 [«]*)) 1 



< p; (5) 



1 



]T Ti(E(X R [i\X R [tf)) <p. (6) 



Let {C(p)} be a sequence of codebooks, where for each p the corresponding codebook C(p) consists of 2 LbR ^ 
codewords, each of which is a m x Lb matrix satisfying equation (5). The sequence of codebooks is said to have 
a multiplexing gain of r if 

hm - — — = r. 

p^oo log(p) 

Further, suppose for such a coding scheme C(p), P e (C(p),p) represents the average probability of decoding error 
at the destination node (averaged over the Gaussian noise, channel realizations and the different codewords of the 
codebook) at an SNR of p, then the optimal diversity order of the channel at a multiplexing gain r is defined as 

cT(r) 4 lira ^1 , (7) 

2 Allowing distinct powers at the source and relay nodes of p and cp, respectively, where c is a constant independent of p, does not alter the 
diversity-multiplexing tradeoff. We assume c = 1 for ease of disposition. 
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where P e *(p) represents the minimum average probability of error achievable on a relay channel minimized over 
the collection of all possible coding schemes, 'if(p), i.e., 

^^A^^ (8) 

In Subsection III-C, we shall show that the optimal diversity order at a multiplexing gain of r can be written as 

d (r) = km * ^— — — (9) 

p-s-oo lOg(p) 

where r*(a, j3, 5) is given by (50) in Subsection III-C and a, (3 and 6 are vectors containing the negative SNR expo- 
nents of the eigenvalues (see equations (14)-(16) in the following section) of HsdH\ d , H sR(I m + pH\ D H sd) -1 ~H\r 
and H RD {I n + pHsoHgjj) -1 Hrd, respectively. Evidently, to evaluate the DMT, we need the joint probability 
density function (pdf) of the eigenvalues of these matrices, which we obtain next. 



A. Joint eigenvalue distribution of three mutually correlated Wishart matrices 

Let us denote the matrices H S dH\ d , H S R(I m + pH^Hsd)^ 1 #g fl and H RD (I n + pH S dH\ d )~ 1 H rd as 
Wi, W2 and Ws, respectively. It is evident from their structure that these matrices are not mutually independent. In 
general, finding the joint pdf of 2 or more mutually correlated random matrices is a difficult problem in the theory 
of random matrices. However, in this section we show that by exploiting the specific structure of these matrices 
and the distribution of the constituent matrices, we can compute a closed form expression for the joint pdf of their 
eigenvalues. 

Let < \ u < ■ ■ ■ < Ai, < p p < ■ ■ ■ < pi and < j q < ■ ■ ■ < 71 be the ordered non-zero eigenvalues with 
probability 1 (w.p. 1) of W±, W2 and W3, respectively. Define A = [Ai, • ■ • A u ], p = [pi, ■ ■ ■ p p ] and 7 = [71, ■ ■ ■ 7 g ] 
with u = min{m, n}, p = min{m, k} and q = min{n, k}. It is convenient to denote the joint pdf of the three 
sets of eigenvalues as Fw 1 w 2 w 3 (A, p, 7) and similarly their marginal and conditional pdfs, i.e., the marginal pdf 
of A is denoted as i*Vi(A), the conditional pdf of p conditioned on A is denoted as F W2 \ Wl {p\\), etc. Consider 
the following lemma which provides the first step towards simplifying the problem at hand. 

Lemma 1: The eigenvalues of W-x are independent of the eigenvalues of W3 given the eigenvalues of W\, i.e., 

F W2 w 3 \ Wl (p^\\) = F W2 \ Wl (p\\)F W3 \ Wl {^\\). (10) 

Proof: The proof is provided in Appendix A. ■ 
Using the above lemma, the joint pdf of the eigenvalues of the three matrices can be expressed as 

F WlW2 w 3 (X,fi,l) = F W2 \ Wl (p\X)F W3 \ Wl (j\X)F Wl (X)- (11) 

The joint pdf of the eigenvalues of W\, which is a central Wishart matrix, can be found for example in [32] whereas 
the conditional pdfs F W2 \ Wl {p\X) and ^V 3 |Wi(7l^) involve complicated functions such as determinants whose 
components are hypergeometric functions of the eigenvalues (e.g., see the proof of Theorem 1 in [25]). However, 
since we are interested only in a high SNR analysis, it is sufficient to obtain i 7 ^^! (a*|A) and i 7 ^!^ (7IA) exactly 
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just up to their SNR exponents, i.e., approximate expressions which have the same SNR exponents as the exact 
joint pdf. For this purpose, we use the following theorem from [25]. 

Theorem 1 (Theorem 1 in [25]): Let H x e C N2XNl and H 2 E C N2XNs be two mutually independent random 
matrices with independent, identically distributed (i.i.d.) CAT(0, 1) entries. Suppose that £i > £2 > • • • £t> > 
and Ai > A2 > ■ • ■ A u > are the ordered non-zero eigenvalues w.p. 1 of V% = h\{In 2 + pH^H^) -1 H\ and 
V2 — H2HI, respectively, with u = min{./V2, -/V3} and v = minjiVi, N2}, and where all the eigenvalues are 
assumed to vary exponentially with SNR in the sense of equation (14). 3 Then, the conditional asymptotic pdf of 
the eigenvalues £ given A is given as 

v («,«) u v {N 2 -j)AN 3 . -pt\\ 

hm)^u^ Ni+N2 ~ 2j) ^ j ) n (e-^on^+^n n ( i x ' - )• 

3=1 ( ni =l,n 2 =l) i=l j=l i=l ^ Kj 1 ' 

{(n 1 +n 2 ) = (N 2 + l)) 

Note that the above theorem gives the conditional pdf of the joint eigenvalues of V\ given the eigenvalues of 
V2 exactly up to its exponential order. This asymptotic distribution is simpler to obtain than its exact counterpart 
and is also sufficient for the DMT analysis. It can be easily verified [33] that the first product term corresponds 
to the joint distribution of the eigenvalues of h\H\. The additional three product terms appear because V\ is a 
Wishart matrix with a non-identity co variance matrix. To see this, note that V\ converges to H\Hi if each of the 
eigenvalues of pH2H\ tends to zero. Indeed, putting p\i — > 0, Mi in the above expression, it is easily shown that 
the last three terms converge to 1 giving the joint distribution of h\H\. 

Clearly, Theorem 1 can be used to derive the asymptotic conditional joint pdf of the eigenvalues of W3 given 
the eigenvalues of W\. Consequently, we have 

q ("><?) u q (n-i)Am , -p7,A\ 

F W3 \ Wl m=\{{i\ k+n - 2i) e^) n K p7 ' A -)ii( i +^) fc n n ( ' A )• (i2) 

i=l (»=l,i=l) i=l (=1 i=l ^ 1 ' 

((i+/)=(n+l)) 

Next, since for each realization of Hsd the eigenvalues of HsdH'sd and Hg D HsD are the same, the conditional 
joint pdf of the eigenvalues of W% given the eigenvalues of W\ can also be derived from Theorem 1 and is hence 
given as 

P Kp) u p (m-j)An . - P u, 1 \ i \ 

F W2lW Am=u(»? +rn - 2j) e-") n (e-^ A -)ii( i +^) fe n n ( e , A . -(m 

j=l (i=l,j=l) i=l j=l i=l ^ 1 ' 

((i+j')=(m+l)) 

Substituting equations (12) and (13) in (11) and importing the expression for Fyf x (X) from [32] the joint pdf of 
(X,p,, 7) up to its exponential order can be obtained. 
Recall next that for the DMT analysis we need the joint pdfs of the negative SNR exponents of these eigenvalues, 

3 This assumption greatly simplifies an otherwise very complicated expression of the pdf. Further, in the context of the problem being analyzed 
in this paper, the usefulness of this assumption will be evident shortly. 
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i.e., those of the transformed variables (a,/3,5) defined via 

Aj = p~ a% , 1 <i <u- 
t* j = p- f3i , l<j<p; 



li = P 



-6, 



1 < I < q. 



(14) 
(15) 
(16) 



Using the above change of variables in equation (11) we get the joint pdf of (a,P, S) (where each vector in this 
triple is simply the vector of the corresponding random variables), denoted as fw!W 2 w s (<5, P, S), given as 



fw 1 w 2 wA a ^^) = ( n i=il J (Mj)l) 9w 2 \w 1 (P\a) (n q l=1 \J(li)\) gw 3 \wA6\a) {U^ =1 \J(\i)\) g Wl (a), 



(17) 



where (/3|a), gw 3 \Wii?\°^) an d 9Wi(&) are obtained by replacing the three sets of arguments in (A, /2, 7) 

using the transformations (14)-(16) in F W2 \ Wl (/2|A), F W3 \ Wl (7(A) and Fyp 1 (A), respectively. The quantities J(A,) = 
—p~ a ' In p, J(pj) = —p~ l3j In p and J (7;) = —p~ S ' In p represent the Jacobians of the transformations in equations 
(14)-(16), respectively. 

We next evaluate the three sets of products of Jacobians and the associated g functions in the overall product ex- 
pression in the right hand side of equation (17) up to exponential order. We begin with (n? =1 1 J(pj)\) gw 2 \Wi (P\®) 
first. Using the transformations (14) and (15) in equation (13) we get 

(u,p) u 

n (e-'^'-^na+p- 04 )* 

J=l (t=l,2=l) »=1 

(s.t. i-\-j= m+1) 

p {m—j)/\n 

n n 



ffw a iw!(^ia)= n ( 



3 (fe+m-2j)ft g-p 



1 - e""' 



i=i 



(18) 



For asymptotic SNR (p — > 00) we have 

0, if Pj < for any 1 < j < p; 

0, if {on + Pj) < 1 for any (i + j) > (m + 1); 



lim e p — yj, 11 HJ 

p— >00 

lim e _p ^ 

p— >00 



(19) 
(20) 



lim 

p— »oo 



(1-3,- ' 



(l_ ft ._ Qi ) 



l-e~ x 



lim 



= 1 



(21) 



p-V-Pi-cH) if + on) < 1; 
1 otherwise; 

Substituting the above asymptotic approximations and the fact that the limiting value of a product of convergent 
sequences is equal to the product of the individual limiting values in equation (18), we get 

p -E 2{a ,0) : if (aJ)eS 2 ; 



0, otherwise, 

where S 2 = {(a, p) : < at < ■ ■ ■ < a u ; < ft < ■ ■ ■ < p p ; (Pj + a,) > 1, V(i + j) > (m + 1)} and 



(22) 



E 2 (a,p) 



a . p 



J2(m + k- 2j + l)pj - k ^(1 - ati) + + J2 0- ~ a i ~ Pi)' 



3=1 



(=i 



j-\-i<m 



(23) 
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Similarly, it can be shown that 

\ P - Ei{&rs \ if (a,S) e 5 3 ; 
0, otherwise, 

where S 3 = {(a, /3) : < at < ■ ■ ■ < a u ; < Si < • • • < 5 q ; (Si + q.) > 1, V(i + > (n + 1)} and 



(24) 



E 3 (a,6) 



+ k-2l + l)6 t -k^2(l - a t ) + + i 1 ~ a * 



4=1 



U = l 

Z+z<n 



(25) 



Finally, using the expression for the pdf of a given in [12] we have 



(U^lJiX^l) g Wl (a)= 



P 



-Er=i(™+«- 2 *+iK, if < a x < ■ ■ ■ < a v 



(26) 



0, otherwise. 
Finally, substituting equations (22), (24) and (26) into equation (17) we get the main result of this section, namely, 
the joint distribution of (a, f3, 7) up to exponential order, which we state in the following theorem. 

Theorem 2: If a, f3 and 7 are the vectors containing the negative SNR exponents of the ordered eigenvalues of 
the matrices W\, W2 and W3, respectively, as defined in the transformations (14)-(16), then the joint distribution 
of (a, (3, 7) is given up to exponential order as 

0, if (a,P,$)tS, 



fw 1 w 2 w 3 ( a i 



(27) 



where 



S = S 2 n S 3 = < 



and 



(Oi + ft) > 1, V(i + j) > (m + 1); 
(ai + Si) > 1, V(i + > (n + 1); 
(a,/3,5): <«!<■•■<«„, 

< /3i < • • • < /3 P , 
< 5 X < ■ • • < S q , 



E{a, (3, S) = J2( n + m-2i+ l)a t + j^(fc + m - 2j + + ^(fc + n - 21 + l)Sj 



(28) 



3=1 

II .p 



1 = 1 



(!.<j 



(29) 



-2fc^(l-a l )++ ^ (1- a, -/?,)++ ^ 

j-\-i<7n l-\-i<n 

III. DMT OF THE MIMO HD-RC 

Assuming global CSI TL at the relay node, it was proved in [9] that the DCF protocol based on the CF scheme of 
[15] can achieve the DMT of the MIMO HD-RC. The actual DMT was however not obtained therein. Here, using 
the asymptotic eigenvalue distribution result of Theorem 2 of the previous section, the DMT of the MIMO HD-RC 
is first characterized as the solution of a convex optimization problem (see Theorem 3) and then simplified to a 
two-variable optimization problem (see Theorem 4). Moreover, since it is shown that the dynamic QMF protocol 
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achieves this fundamental DMT, only knowledge of the optimal switching time is required at the relay to achieve 
the DMT of the MIMO HD-RC. This is in contrast to the DCF protocol of [9] which requires global CSI at 
the relay. Later in Section V, it is proved that that even the switching time information, while sufficient, is not 
necessary under certain conditions. In particular, it is shown that the DMT of the (n, l,n) and (l,k, 1) HD-RCs 
can be achieved without CSI at the relay. This is also the case for more general classes of MIMO HD-RCs but 
only for sufficiently high multiplexing gains. 

To characterize the DMT, we first prove that P*(p) (see (8)), the minimum average probability of decoding 
error achievable on the channel at an SNR of p, is exponentially equal (recall definition in (2)) to the probability 
of an appropriately defined outage event. In Subsections III-A and III-B we derive an upper bound and a lower 
bound for the outage probability, respectively, which are in turn exponentially equal. The lower bound to the outage 
probability is based on an upper bound on the instantaneous cut-set bound of the channel. The upper bound on the 
outage probability is derived from an achievable rate expression of the QMF protocol operating dynamically on the 
relay channel. Finally, analyzing these bounds in Section III-C, we derive the DMT of the channel by computing 
the negative SNR exponent of the outage probability. 

It is well known that on a slow fading point-to-point channel the maximum rate at which information can be 
reliably transferred to the receiver depends on the channel realization, and is hence a random quantity. In what 
follows, this rate will be referred to as the instantaneous capacity of the channel. For a particular channel realization, 
if the rate of transmission is larger than the instantaneous capacity of a point-to-point channel, we say the channel 
is in outage. The same is true for a relay channel, where in addition a relay node helps the end-to-end transmission 
between the source and the destination nodes. Further, on a dynamic HD-RC, the instantaneous capacity of the 
channel also depends on the switching time of the relay node and should be chosen optimally to maximize it. Let 
tdift) represent the optimal switching time and let the instantaneous capacity be denoted as C a {H,id('H)) ■ Using 
this notation we next define the outage event. 

Definition 1 (Outage event): The HD-RC is said to be in outage if for a particular channel realization, H and 
SNR p, and the rate of transmission, i? = r log(p) (in bits per channel use (Bpcu)) is larger than its instantaneous 
capacity. The corresponding outage event is denoted as O, so that 



Let Pr(O) denote the outage probability and let do{r) denote its diversity order, i.e., do{r) = lirrip^oo — ^^7^ ■ 
We have the following lemma. 

Lemma 2: The minimum probability of decoding error achievable on the MIMO HD-RC, P*(p) (see (8)) is 
exponentially equal to the outage probability. Hence the corresponding diversity orders are also equal, so that 



0±{H:C (HMH)) <r\og(p)}. 



(30) 





(31) 



where d*(r) is defined in (7). 

Proof of Lemma 2: The proof is identical to that in [12]. 
In the next section, an upper bound on the DMT of the MIMO HD-RC is obtained. 
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A. An upper bound on instantaneous capacity ( and DMT) 

From the discussion in Section I we have that an upper bound on the DMT for the family of fixed and dynamic 
protocols is also an upper bound on the achievable DMT of any (cooperative) communication scheme on the 
MIMO HD-RC. Thus we restrict attention, without loss of generality, to the family of fixed and dynamic protocols. 
Assuming that the relay node switches from the listening mode to the transmit mode at time td, we have that 
any achievable rate R on the relay channel for which the error probability can be made arbitrarily small is upper 
bounded using the cut-set bounds for the HD-RC [34], [35] so that 

R< max mm{Ic s (td),Ic D (td)} = max t A (32) 

{t d ,P(X s ,X R )} t d 

where C(H,t d ) denotes the cut-set bound of the channel for a given td and 

Ics ^ = t d I (X S \ Y D , YrIpi) + (1 - t d )I (X s ; Y D \X R ,p 2 ) ■ (33) 
Ic»(td)=tdI(X s ;Y D \ Pl ) + (l-t d )I(X s ,X R ;Y D \ P2 ), (34) 

represent the maximum mutual information that can flow across the cuts around the source and destination, 
respectively. 

The following two-part lemma provides (i) upper bounds to both Ic s and Ic n and (ii) a lower bound to C(H, td). 
Lemma 3: i. The cut-set mutual informations Ic s (td) an d Ic D {td) m equations (33) and (34), are upper 
bounded as 

max Ic s {td) < Icsitd) = td log (L s ,rd) + (1 - U) log (L SD ) , (35) 
i p x s ,x R } 

max Ic n (td) < Icn^d) = *d lo S ( l sd) + (1 - U) log {L SR>D ) , (36) 

{Px s ,x R \ 

where H s ,rd = [h™] » H S r,d = [H SD H RD ] and 

L SD = dot (h sd hI dP + /„) , (37) 
Lsr,d = det (pHsr,dH\ r d + I n ) ; (38) 



Lsmd = dot (pH s ,rdHI R d + In+kj ■ (39) 
ii. Moreover, the cut-set bound C(H,td) is lower bounded as follows: 

C(H, t d ) >mm{l' Cs (t d ),l' CD (t d )} - (m + k). 

Proof: See Appendix B. ■ 
Now, continuing from equation (32) we have 

R < r max min {I Cs (t d ), I Cd (t d ) } , 

<maxmin< max Ica(td), max Icn(td) 

{t d } {{P(Xs,X R )} {P(XsA)} 

(a) 



ain {l'c s (td),l'c n (td)} = vcmxRu(td), (40) 
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where in step (a) we used the set of upper bounds from the first part of Lemma 3 and the definition Ru(td) = 
min (l'c s (td), l'c D Note that td in equation (40) can be a function of the channel matrices since we are 

considering a dynamic HD-RC. Since the right hand side of equation (40) is maximized when I c (td) = I Cd (td), 
equating equations (35) and (36) we get the optimal value for the switching time as 



log (l*3JI 

t* d = , r __y aD/ , r — 



Putting this value of t d in equation (40) we get 

log log f i&SS.) 

R < S V V; \ + log (L SD ) ± R h . (42) 

Since any rate up to the instantaneous capacity C (H,id(H)) is achievable, we have C (1-L,td(7i)) < Rjj. This 
inequality when used along with the definition of the outage probability in (30) yields 

O = {U : C (H,i d (n)) < rlog(p)} D {H : R* v < rlog(p)} 4 O v , (43) 

from which we have a lower bound on the outage probability, Pr{0} > Pr{0[/}- Using (31), we then obtain an 
exponential lower bound on the minimum achievable probability of decoding error, and hence an upper bound on 
the DMT as 



P:(p)>Pr{Ou} =j. d*(r)<du(r) (44) 

log(Pr(O c 
p-HX log(p) 



where du(r) is the diversity order of Pr{0j/}, i.e., du(r) = lim 



B. A lower bound on instantaneous capacity ( and the DMT) via the QMF scheme 

Since the instantaneous capacity of a slow fading channel is the supremum of the achievable rates of all possible 
coding schemes, the achievable rate of a particular coding scheme yields a lower bound to it. We first derive such 
a lower bound for the HD-RC by computing the achievable rate of the QMF protocol [13] which when substituted 
in the definition of the outage event results in an upper bound to the outage probability yielding in turn the desired 
lower bound to the DMT. But first a brief review of the QMF coding scheme. 

1 ) The coding and decoding strategies of the QMF scheme: The encoding method at the source node of the 
QMF scheme is a two step procedure involving two different codes. The inner codebook, denoted by T Xs has 2 RT 
mutually independent codewords, i.e., 

where for each w € {1, • ■ • , 2 RT } is a random T length vector with i.i.d. CN(0, 1) components. Each codeword 
of the inner codebook is treated as a symbol of the outer code. To transmit a message U € {1, • • • , 2 }, the 
source node first maps it onto a N length sequence of symbols of the outer code, i.e., Wi,W2, ■ • ■ ,wn- Each of 
these symbols are then encoded into a codeword in T xs , i.e., Wk — > x 1 ^"^ for 1 < k < N. Finally, the message U 
is transmitted over NT channel uses and hence at a rate of R bits per channel use. 
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The relay node operates on each codeword of the inner codebook in two phases (listen and transmit). For the 
first td fraction of each codeword ig 1 "'' the relay node listens to the source transmission. Let y^^ represent the 
received signal at the relay during the listening phase with the transmission of Xg k '. The relay node first quantizes 
y [ ™ k) at the noise floor and randomly maps the resulting quantized signal y^^ to a random Gaussian codeword, 
Xft k \ i.e., = /i?(y^" fc ' ) ), where represents a random mapping scheme. The random vector Xjj is 

then transmitted by the relay during the remaining (1 — td)T channel uses. The same procedure is repeated by the 
relay for all N inner codewords that make up the source codeword. Given the knowledge of the relay mapping, 
/r(.), global CSI H, and the received sequence y^^ G C" xT for k = 1, • • ■ ,N, the destination node decodes the 
message. 

2) Achievable rate of the QMF protocol: Recall that the cut-set upper bound to the instantaneous capacity of 
the channel for a given listen-transmit scheduling of the relay (i.e., fixed td) node was denoted by C(H,td) [34], 
[35]. In [13] it was proved that for a given td, the QMF protocol can achieve a rate R q (H,td) on a relay channel 
with channel matrices H, where 



and t is independent of both the channel matrices and p. The above rate satisfying equation (45) can be achieved 
by the QMF protocol for any given td as long as it is known to the relay node. In particular, putting td = t* d (given 
by equation (41)) in equation (45) we get 



In other words, a rate which is within constant number of bits to C(7i,t d ) can be achieved by the QMF protocol. 
Note that, t* d is a function of the instantaneous channel realizations, T~L (e.g., see equation (41)) and can be computed 
by the relay node since we assume global CSI at the relay node. 
From the second part of Lemma 3 we have 



C(H,t*d) >mm{I Cs (t d ),l' CD (t d )} - (m + k), 
>Ru - (m + k), 

where the last step follows from the fact that Ic s {t*d) = ^-c D (^d) = R-u ( see equation (41)). Now, substituting the 
last lower bound to C(TL,t d ) in equation (46) we get 



R q (n,t d )>c(n,t d ) 



(45) 



R q (n,t d )>c(n,t d ) 



T. 



(46) 



R q {n,t* d ) >R* V - (m + fc + r), 




—Ru ^ Ro, 



(47) 
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where Rq = (m + k + r). 

Clearly, the instantaneous capacity C (%,tdCH)) is larger than or equal to any achievable rate on the channel, 
i.e., C (T-L,td{TL)) > R%. This inequality along with the definition of outage probability in (30) yields 

O = {% : Co (H,i d (H)) < rlog(p)} C {% : R* L < rlog(p)} 4 Q L , 

which in turn implies that P*{p) = Pr{0} < Py{Ol}, where the exponential equality is from (31). Now, since 
R* L = Ry — Rq and Rq is independent of the SNR (p) and H, we have at asymptotically high SNR that 

Pt{O l } =Pr{R*L < rlog(p)} = Pr{R* v - R < rlog(p)} 
=Pr{i?^<rlog(p)}=Pr{0 £7 }. 

Hence, P* (p)<Pi{Ojj} and combining with (44) we have that Pt{Ojj} characterizes P*{p) exactly up to 
exponential order, i.e., 

P:( P )=Pt{Ou} 
so that the DMT of the MIMO HD-RC can be expressed as 

^(r)=^(r)=lim- l0S f r ^» . (48) 

p^co lOg(p) 

with Ojj defined in (43) in terms of R^ which in turn is defined in (42). In the next section, we evaluate this DMT. 

Remark 1: The QMF protocol can achieve the DMT of the MIMO HD-RC with knowledge of only t* d , the 
switching time that maximizes the cut-set bound (in lieu of the true optimal switching time id)-, i.e., it does not 
require the explicit knowledge of Hsr, H$d and Hrd- 

C. The DMT as a solution to an optimization problem 

Evidently, to obtain d*(r) the probability distribution of R^, which is a function of the three channel matrices, 
is needed. However, by simplifying the expression for Ry, it is shown that just the joint eigenvalue distribution of 
the three composite channel matrices Wi, for 1 < i < 3, defined in Section II- A, suffices. Further simplification 
shows that only the joint distribution of the SNR exponents of these eigenvalues is sufficient to obtain d*(r). 

Lemma 4: The optimal diversity order of the MIMO HD-RC can be written as 

d (r) = lim — , (49) 

p-»oo log(p) 



where r*(a,P,S) is given as 

" e?=i(i-«o + e?=i(i-&: 



E?=i(i-*0 + + ELi(i-& 



£(1 - " t ) + ; (50) 



and cti's, /3/s and Si's are the negative SNR exponents of the eigenvalues of HsDHgp, HsR(I m +pHg D HsD) 1 Hsr 
and H^niln + pH S dH\ d )~ x H rd , respectively. 

Proof: The proof is given in Appendix C. ■ 
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Using the joint pdf of {a, (3,5} given by equation (27), Pr {r*(a, 0, 5) < r} is evaluated and using equation 
(49), the optimal diversity order d*(r) is obtained, leading to the following theorem. 

Theorem 3: The solution of the following optimization problem yields the fundamental DMT of the MIMO 
HD-RC: 



min F (a, 0,5) (51) 



subject to the following constraints 



S (1 -°' ) + EpT^ft)T£CT^) ^ ,52) 

a m - j+ i + > 1 VI < j <p, (53) 

On-i+i + Si > 1 VI < I <q, (54) 

< ai < ••• < a u <1, (55) 

< 0! < ■ ■ ■ < P <1, (56) 

< < • • ■ < 8 q <1; (57) 



where 



F (a, 0, 5) = ^2(n + m + 2k-2i + ^(fe + m - 2j + 1)^- + ^(k + n - 21 + 1)S 3 - 2ku 

i=i j=i i=i 

u,p u.q 

+ (1 - «*i - + £ (1 - «i - ^) + - (58) 

i,j=l i,/— 1 

j+i<m l+i<n 

Outline of proof: It is clear from equation (49) that the optimal diversity order is equal to the negative SNR 
exponent of Pr {r*(a 7 0, 5) < r}. Using the joint pdf of (a, 0, 5) obtained in Subsection II- A, this probability can 
be written as an integral of the pdf over the subset of the sample space of (a, 0, 5) where r*(a, 0,5) < r (call it 
T>). From Laplace's method it follows that this integral is dominated by a term having the minimum negative SNR 
exponent over T>. The details are provided in Appendix D. ■ 
Remark 2: It is well known that the fundamental DMTs of the (m, n) and the (n, m) point-to-point MIMO 
channels are identical. From (1) it is also clear that the DMTs of the (m, k, n) and the (n, k, m) MIMO FD-RCs are 
identical. The above theorem proves that this reciprocity property of DMT extends to the MIMO HD-RC as well as 
can be seen from the symmetry in m and n of the optimization problem of (51). In other words, the fundamental 
DMTs on the (m, k,n) and (n,k,m) MIMO HD-RCs are identical. Henceforth, we let m > n without loss of 
generality. 

Note that J2i=i a i> Sj=i &j an d S?=i are affine functions of the cti's, 0/s and 5i's, respectively. Furthermore, 
by computing its Hessian, it can be easily proved that the function ^Zx)+(q-y) ' s convex with respect to x and y. 
Further, since ,f(g(V)) is convex whenever /(.) is convex and g(.) is affine, it is evident that (p-Y^^p^+iq-Y^ 1 s,) 
is a convex function. Hence the left hand side of the inequality constraint (52) is a convex function. It is also clear that 
the objective function in (58) is convex and that all the other constraints (53)-(57) are convex. Thus the optimization 
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problem of Theorem 3 is a convex optimization problem and it can be solved using convex programming methods 
[36]. 

The number of variables in the optimization problem in (51) however, increases with m,k and n linearly. In 
what follows, we show that the problem can be simplified to an optimization problem with only two variables, 
independently of to, k and n. 




(a) Cases where DMTs of FD- and HD-RCs are identical (b) Cases where DMTs of FD- and HD-RCs are not identical 



Fig. 4: DMTs Comparisons for MIMO HD- vs. FD-RCs. 



Theorem 4: The fundamental diversity-muliplexing tradeoff of the (m, k, n) HD-RC is given as 



Sm(r-a) , 
(s„, -r+a) ' Um 



min{p, (to — a)}, 



where the interval 7Z is specified in equation (100) in Appendix E, B 
s m = mm{q, (n — a)} and fa's are as defined in equations (90)-(92) in Appendix E. 

Proof: The proof is given in Appendix E. ■ 

Example 1: We illustrate the advantage of relaying relative to point-to-point communication by considering the 
networks CNi and CN2 of Fig. 1(a) and Fig. 1(b). Fig. 5(a) applies to the uplink of CNi (in which to = k < n) 
and depicts the DMT performance of the relay channel with respect to that achievable on the corresponding point- 
to-point channel. Similarly, Fig. 5(b) applies to the uplink of CN2 (in which to < k < n). These figures clearly 
demonstrate the superior performance of cooperative MIMO over point-to-point MIMO communication. 

Remark 3: The explicit numerical computation of the fundamental DMT reveals several interesting character- 
istics of the MIMO HD-RC. For example, for the class of (to, k, ri) HD-RCs where to > n > k we found that 
the DMT is identical to that of the corresponding MIMO FD-RC. It appears to be difficult however to show this 
analytically. Note that this scenario applies to the downlink of the two networks CNi (with m > n — k) and CN2 
(with m > n > k) of Fig. 1(a) and Fig. 1(b), respectively. Fig. 4(a) illustrates this fact for a few specific examples 
of MIMO RCs. Thus, for the class of MIMO HD-RCs for which to > n > k, the half-duplex constraint does not 
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Fig. 5: The fundamental DMTs of the MIMO HD-RC and the corresponding point-to-point MIMO channel. 

appear to be restrictive in terms of DMT performance. In general however, the MIMO HD-RC has different DMT 
characteristics than the MIMO FD-RC. For instance, see Fig. 4(b) (which is relevant for the sensor network CN3 
of Fig. 1(c)). This will be further discussed in the next section. 

Conjecture 1: For the class of (m,k,n) HD-RCs for which m > n > k the DMT is equal to that of the 
corresponding MIMO FD-RC. 

IV. Closed form expressions for the DMTs of a few classes of relay channels 

A closed form expression of the DMT would provide more insights about the system than a numerical solution. 
Motivated by this fact, we next provide closed-form solutions for the fundamental DMT of special classes of MIMO 
HD-RCs specified by the relationship between the numbers of antennas at the three nodes, including the (n, k,n) 
(henceforth, called symmetric since m = n) HD-RC. 

Theorem 5: The optimal diversity order d*(r), at a multiplexing gain of r, of the (n,k,n) HD-RC is upper 
bounded by dfj (r) (defined below) as 



[ min {1 <i<(3 +2p ) } dut(r), k > n, 
where recall that p = min{m, k} = min{fc, n} (since m = n) and for N = 1, • • ■ ,p, and recalling that dr nti „ r \(-) 
represents the fundamental DMT of a MIMO point-to-point channel with n t transmit and n r receive antennas, we 




(60) 
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define 



d Ul (r) = d( n ,n+k) (r), for < r < n, (61) 
d U2 {r) = d( 2 „,2n)(2r), for n - - < r < n, (62) 

dc/ 3 (r) =n 2 +^](n + fc-2/ + l) (1- ("^-y-Z + l^ J , for < r < |, (63) 



i=l 



/ N\ N f jV iV 2 1 

d C/(3+N)( r ) = ^ + rf(«-iV),(n+2fe-iV) [r- — \ , for — < r < mux in - —, n - — _ — j , (64) 

iVn 
(jV + n) 



(«-JV) + 
rf(3+ P +w)(r) = J! ( 2n + * - JV - 2i+ 1) (l - (ojv - i + 1) + ) + iV 2 , for " < r < n - — , (65) 



and a,v is given by equation (115) in Appendix F. 

Proof: The proof is given in Appendix F. ■ 

Evaluating the exact DMT of the (n, k, n) relay channel using Theorem 4 for several values of n and k we found 
that d*(r) = dfj(r) which leads us to make the following conjecture that the upper bound dfj(r) is in fact tight. 

Conjecture 2: On a symmetric (n, k,n) relay channel d*(r) = d(j(r), where d(j(r) is given by equation (60). 

Remark 4: From the expression of djj 2 (r) in equation (62) we see that this particular upper bound does not 
depend on k for k > n. Thus, when this bound is active, adding an extra antenna at the relay node does not 
improve the DMT performance of the channel. This is an interesting difference between the HD- and FD-RCs since 
for FD-RCs every additional antenna at the relay improves the diversity order for all values of multiplexing gains 
(recall (1)). Empirical results show that djj 2 (r) is a tight bound for the DMT on the (n, k, n) HD-RC for r > j and 
k > r^rl- For example, Fig. 4(b) illustrates this fact by showing that while adding an extra antenna on the (2, 3, 2) 
FD-RC uniformly increases the achievable diversity orders at all multiplexing gains, the achievable diversity order 
on the corresponding HD-RC does not change for r > 1. 

Remark 5: Another interesting fact revealed by Theorem 5 is that, as the number of antennas increases at 
the relay node, the difference in the DMT performance between the FD-RC and the HD-RC increases. From the 
expression of the upper bound du 3 (r) in equation (63) we see that at a multiplexing gain of r = § the diversity 
order achievable on the HD-RC is upper bounded by n 2 but on an FD-RC this is clearly not the case where the 
diversity order increases with k. Fig. 6(a) demonstrates this phenomenon on a (2, k, 2) relay channel which is 
applicable to the CN3 scenario of Fig. 1(c). Intuitively, the above phenomenon occurs because as the number of 
antennas at the relay increases the signal forwarded by the relay node can significantly contribute to enhancing the 
diversity of the received signal at the destination node and hence the half-duplex constraint becomes increasingly 
more restrictive (relative to the FD relay) because the relay node can not transmit in the listening phase. 

Remark 6: A similar argument holds for the comparison between the DMT performance of the static and 
dynamic HD-RCs. In contrast to a static channel, since on a dynamic channel the switching time varies depending 
on the instantaneous channel matrices, it is expected that a larger number of antennas at the relay node make a 
bigger difference on the DMT performance. 
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Fig. 6: A comparison of DMTs on Dynamic vs. Static and HD- vs. FD-RCs. 



The following result gives an explicit formula for the DMT of the (1, k, 1) HD-RC. 
Theorem 6: The optimal DMT of a half-duplex (1, k, 1) HD-RC is given as 

f (fc+l)(l-r), 0<r < j^j- 

d (i,M)W= l + (ITT) < r <b C 66 ) 

[ 2(1 -r), i <r < 1. 

Proof: The proof is given in Appendix G. ■ 
Remark 7: A comparison with the DMT of the class of static (l,k, 1) HD-RCs (derived in [10]), numerical 

examples of which are given in Fig. 2(a) and Fig. 2(b), reveals that the DMT of such static HD-RCs are strictly 

smaller than that for their dynamic counterparts for r < |. 

The next result gives an explicit DMT formula for the class of symmetric HD-RCs with single-antenna relays. 
Theorem 7: The DMT of the (n, l,n) HD-RC is given by a piece-wise linear curve whose corner points at 

integer values of r are given as 

d* n ,l,n)( r ) = (r) 

= (n-r)(n+l-r), 0<r<n. (67) 

Proof: The proof is given in Appendix H. ■ 
Before proceeding to the next section, we summarize the findings of the previous two sections. The explicit 
computation of the DMT of the MIMO HD-RC enabled the proof of the existence of classes of HD-RCs whose 
DMT performances are (a) strictly inferior to that of the corresponding FD-RCs (with certain m, k, ri) and (b) equal 
to that of the corresponding FD-RCs (when m > n > k, see Remark 3). Furthermore, closed-form solutions for the 
two-variable optimization problem are obtained for two classes of symmetric HD-RCs, namely, the (n, 1, n) and the 
(1, k, 1) HD-RCs (see Theorems 6 and 7). More generally, the explicit DMT is upper bounded for the symmetric 
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(n, k,n) HD-RCs and is conjectured to be tight (see Theorem 5 and Conjecture 2). These solutions reveal certain 
special characteristics of the HD-RC which are different from that of the FD-RC and the static HD-RC. For example, 
while an extra antenna at the relay node uniformly improves the DMT for an FD-RC this is not the case for the 
HD-RC. Moreover, the greater the number of antennas at the relay, the greater is the difference between the DMT 
performances of the FD- and HD-RCs. It is also observed that within the class of HD-RCs, static operation of the 
relay in general limits performance relative to unconstrained (or dynamic) operation of the relay with the difference 
in DMT performance becoming more pronounced with an increasing number of antennas at the relay. 

V. ACHIEVABILITY OF THE DMT WITHOUT SWITCHING TIME AT THE RELAY NODE 

In the previous section we established the fundamental DMT for the MIMO HD-RC. It hence represents the 
DMT achievable by the best cooperative protocol among all admissible ones with global CSI at both the relay and 
the destination. The QMF scheme of Section III-B however achieves this fundamental tradeoff with just the relay 
having knowledge of the switching time t* d defined in (41) (with the destination node having global CSI). In this 
section, we explore the question of whether there are situations in which even the switching time knowledge at the 
relay is not necessary. Note that the DCF protocol was shown to be optimal from the DMT perspective in [9] so 
that the DMT of the previous section is achievable by this protocol too but it requires global CSI at the relay. In the 
case of the SISO HD-RC, the QMF scheme of [13] was shown in [28] to achieve, with switching time of 1/2 (and 
hence without knowledge of switching time at relay), the fundamental DMT of the SISO FD-RC which in turn is an 
upper bound for the SISO HD-RC, so that dynamic operation of the relay in this case does not help from the DMT 
perspective. Does this result generalize to MIMO HD-RCs? It turns out that it does in some cases. In particular, we 
show that for the (n, l,n) MIMO HD-RCs the DMT (given by Theorem 7) can indeed be achieved by the QMF 
protocol with a channel independent switching time. In other words, on this class of RCs even global CSI at the 
relay node does not help in terms of DMT performance, generalizing the SISO HD-RC result of [28]. Moreover, 
for the class of (1, k, 1) HD-RCs we show that for all multiplexing gains r 6 [0, 1/2] the optimal tradeoff curve 
can be achieved by the dynamic decode-and-forward (DDF) protocol analyzed for the general MIMO HD-RC by 
the authors in [25] and for all multiplexing gains r e [1/2, 1] it is achieved by the static QMF protocol and neither 
of these protocols requires knowledge of Hsd and Hbd at the relay node. 

We shall show that while on the two specific classes of RCs as specified earlier the optimal diversity order at 
all multiplexing gains can be achieved without any CSI at the relay node, the same is also true in general but only 
for higher multiplexing gain values (e.g., see Fig. 6(b)). 

Remark 8: The DMT of the static MIMO HD-RC was obtained in [10] with the static CF protocol as the 
achievability scheme which requires global CSI at the relay. From Theorem 8.5 in [13] however, we have that on 
the static HD-RC, the QMF protocol, which doesn't require any CSI at the relay node (cf. Section VIII-A of [13]), 
can achieve the instantaneous capacity within a constant gap. Since a constant number of bits is insignificant in 
DMT metric, the QMF protocol can hence achieve the DMT of the static HD-RC. 

Remark 9: If the DMT of the static and dynamic HD-RCs are identical over some range of multiplexing gains 
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then the optimal diversity orders at those values of the multiplexing gains can be achieved without any CSI at the 
relay node. 

Example 2: From Fig. 6(b) it is clear that the optimal diversity order achievable on the (2,2,2) and (2,4,2) 
HD-RCs can be achieved by the QMF protocol without any CSI at the relay for r S [1.5, 2] and r e [1, 2], 
respectively. For all other values of r, the relay node requires the optimal switching time (or t^) to achieve the 
maximum diversity orders achievable by a dynamic protocol. 

We turn our attention now to the MIMO FD-RC. The optimality of the CF protocol in the DMT metric was 
proved in [9] (and the DMT itself was found to be given by (1)) but this protocol requires global CSI as discussed 
earlier. 

Remark 10: It was shown in [13] that the (full-duplex version of the) QMF protocol can achieve the instantaneous 
capacity of the FD-RC to within a constant number of bits. Hence, the DMT of the MIMO FD-RC can be achieved 
by the QMF protocol without any CSI at the relay node and global CSI at the destination node. 

As an immediate application of Remark 8 and Theorem 6 we get the following result. 

Corollary 1: The fundamental DMT of the (1, k, 1) relay channel can be achieved by the DDF protocol, for 
multiplexing gains in [0, |] and by the static QMF protocol for multiplexing gains in the interval [|, 1] . While the 
DDF protocol requires only CSIR, the QMF protocol does not require any CSI at the relay node, i.e., neither the 
DDF nor the QMF protocol requires global CSI at the relay node. 

Proof: Theorem 6 provides the optimal DMT on a (1, k, 1) relay channel. Comparing it with the DMT of the 
DDF protocol on this channel derived in [25] which is restated here for convenience, namely, 

f (ft+l)(l-r), 0<r< pjijy; 

{ (^),|<r<l, 

it is evident that the fundamental DMT of the (1, k, 1) HD-RC can be achieved by the DDF protocol for < r < i. 
Further this DMT can be achieved by the DDF protocol with only the knowledge of Hsr at the relay node. 

On the other hand, it was proved in [10] that the DMT of the static (1, k, 1) HD-RC is 2(1 - r) for ± < r < 1. 
For | < r < 1, this DMT can be achieved by the QMF protocol without any CSI at the relay by Remark 9. ■ 

The key enabling result for Corollary 1 beyond the DMTs of the DDF and the static HD-RC is the explicit DMT 
of the (1, k, 1) HD-RC of Theorem 6. Moreover, to the best of our knowledge, Corollary 1 is the first result on 
the achievability of the DMT of a non-SISO HD-RC without global CSI at the relay node. This result however 
requires two different protocols for the two ranges of multiplexing gains. In this sense, the above result doesn't truly 
generalize the result of [28] in which it is shown that the QMF protocol achieves the DMT of the SISO HD-RC. 

Example 3: Comparing the DMT of the DDF protocol with the fundamental DMT of the (1,2,1) relay channel 
depicted in Fig. 2(b), we see that the DDF protocol is DMT optimal on this channel for a multiplexing gain in the 
range [0, g]. Moreover, since the static DMT is strictly smaller than that of the corresponding dynamic channel in 
this range, the CF and QMF protocols require global CSI and the optimal switching time information at the relay 
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node, respectively, to achieve optimal DMT performance. However, the DDF protocol needs only source-to-relay 
CSI at the relay node. Clearly, the cooperative protocol of choice in this case (i.e., with r £ [0, 5] ) is the DDF 
protocol. 

In what follows, we identify a class of non-SISO RCs, namely the (n,l,n) HD-RCs, on which the DMT of 
the channel can be achieved by a single protocol, namely the static QMF protocol with no CSI at the relay node, 
thereby generalizing the result of [28]. This result is shown by proving that the DMTs of the static and dynamic 
(n, l,n) HD-RCs are identical. In other words, for this class of HD-RCs dynamic operation of the relay does not 
help from the DMT perspective. We start by first finding in closed form the DMT of the static (n, l,n) HD-RC. 

Theorem 8: The DMT of the static and dynamic (n, 1, n) HD-RCs are identical, i.e., is given by 

<1,n)W = rf(„,i, n) W = < P +1)in (r), 0<r<n. (68) 

Hence, the DMT of the (n, 1, n) HD-RC can be achieved by the static QMF protocol with no CSI at the relay node 
(i.e., without the knowledge of even the optimal switching time). 

Proof: The second equality in (68) is just the result of Theorem 7. If the first equality holds, it means that the 
static QMF protocol (which, by Remark 8, achieves the DMT of the static HD-RC without any CSI at the relay 
node) achieves DMT of the dynamic (n, 1, n) HD-RC without any CSI at the relay node. It remains to prove the 
first equality. 

The DMT of the symmetric (n, fc,n) static HD-RC was established as the solution of a convex optimization 
problem in [10] and an analytic expression for an upper bound to the DMT was provided therein. The proof of 
the above lemma is given by obtaining an exact closed form solution to that optimization problem for the case of 
k = 1. Our starting point is thus equation (13) in [10] which is restated here for convenience, 

n n n—1 

where 

n 

T = {(a, ft) : 5^(1 - a j; ) + + -(1 - ft) + < r; < a x < a 2 <, ■ ■ ■ < a n ; < ft; (ft + a„) > l}. (70) 

Using an argument similar to that in the proof of Theorem 3 in Appendix D, it can be shown that a further restriction 
to a„,ft £ [0, 1] can be made without changing the solution of (69) but it greatly simplifies the problem as 

n n—1 

dfnin)( r )= min Yi2n-2i + 2)a l +np l -n + Y{\ -ft -<*)+. (71) 

where 

n 

t = {(a,ft) : £(1 - ai) + -(1 - ft) < r; < a, < a 2 <, ■ ■ ■ < a n < 1;0 < ft < 1; (ft + a n ) > l}(72) 

?:=i 

The rest of the proof that the solution of (71) is given by the lemma follows from induction, the details of which 
are relegated to Appendix I. ■ 
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Remark 11: Note that the DMT of the FD (ra,l,n) RC is also given be d p ^ +1) n (r) [9]. Therefore, on the 
(77, 1, 77) HD-RC, the DMT of the (77, 1, 77) FD-RC can be achieved by an HD relay without any CSI at the relay 
node. 

Remark 12: Although, for a large number of 77 £ N the DMT of the static (77, 1, 77) RC was computed in [10] 
and was observed to be equal to $^+1) n( r )' tne anarvs i s of [10] only proves that djy^ „( r ) represents an upper 
bound to the DMT of the static (77, 1, 77) RC. Therefore, the conclusion of Theorem 8 cannot be obtained from the 
result of [10]. 

VI. Conclusion 

The fundamental DMT of the three-terminal (777, k, n) HD-RC is characterized. This allows an in-depth compari- 
son of half-duplex and full duplex relaying as well as dynamic and static operation of the relay as a function of the 
numbers of antennas at the three nodes. Unlike in the single-antenna relay channel, half-duplex relaying in general 
results in a penalty relative to a full-duplex relaying and an improved performance relative to static half-duplex 
relaying at high SNR performance as measured by the DMT metric. The achievability of the fundamental DMT 
is shown via the dynamic QMF protocol [13] which requires only the knowledge of the optimal switching time at 
the relay. Classes of HD-RCs for which dynamic operation of the relay doesn't improve performance over that of 
static relaying are identified. For such RCs the knowledge of switching time is not needed either. The problem of 
characterizing the DMT of the relay channel with multiple relays is one for future research as is the problem of 
finding finite block-length coding schemes that are DMT optimal. 

Appendix A 
Proof of Lemma 1 

Let the singular value decomposition of Hsd be given as UAqV\ where U G c™x™ an( j y £ $ynxm are 
mutually independent unitary random matrices distributed uniformly over the set of square unitary matrices of 
corresponding dimensions (e.g., see equation (3.9) in [37]). Using this fact we can write 

HsdHI d = UAU\ and hI d H sd = VA 2 V\ (73) 

where the sets of non-zero elements of A2 and A are identical. In particular, given one, the other is fixed. Putting 
this in the expressions for W2 and W3 we get 

W 2 = H SR V (I m + A2)" 1 V^H ] SR = H SR (I m + A?)- 1 H f SR , (74) 
W 3 = H RD U(I n + Ay 1 U^H RD = H RD (I n + A)' 1 H RD , (75) 

where Hg^ = HsrV and Hrd = U^Hrd are mutually independent random matrices that have the same 

distributions as H$r and Hrr>, respectively, since Hsr and Hrr> are unitarily invariant (cf. [33]). Letting 

~ _ 1 ~+ _ i 

A = Hsr (I m + A 2 ) 2 and B = H RD (I n + A) 2 we realize that both A and B still have mutually independent 
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Gaussian entries conditioned on A. Computing the conditional correlation between the two we get 

E(flU|A 2j A) =E^I n + Ay i -H RD H SR (I m +A 2 y^ |A 2 ,a) , 

\l n + A)"* U^H rd HsrV (I m + A a )~* \A 2 ,a) = kxk , 



where the last step follows from the fact that components of H$r and H R o are zero mean and mutually independent. 
This, along with the fact that A and B are Gaussian [38] proves that conditioned on A2 or A, they are independent. 
This in turn implies that W 2 = AA^ and W3 = BB^ are independent given A. Consequently, the eigenvalues of 
W 2 are independent of the eigenvalues of W3 given A. 



Appendix B 
Proof of Lemma 3 

A. Proof of Part i 

The proof consists of upper bounding in two steps the mutual information terms of the form I(X;X + Z), 
subject to a sum power constraint on X. First, we use the fact Gaussian input is optimal, and then in the second 
step, we use the monotonically increasing property of the logdct(.) function in the cone of positive semi-definite 
matrices. 

Suppose Y = HX + Z, where Z ~ CAf(0, 1), H G C NxM and Cov(AT) ^ Q, then it is well known [34] that 



max I(HX + Z;X) = I(HX G + Z; Xq) = log dct (I + HQH^) 

{Cov(X)±Q} y ' 

where Xq ~ CJ\f(0, Q). Similarly, for a sum power constraint we have 

max UHX + Z; X) = max max I(HX + Z; X) 

{Tr(Cov(X))^p} Tr(Q)<p{Cov(X)^Q} 



(76) 



(a) 



max log det (I + HQH^) 

MQ)<p 



<logdet (1 + pHH 



(77) 
(78) 



where step (a) follows from (76) and the last step follows from the fact that Q < pi and log det(.) is a monotonically 
increasing function in the cone of semi-definite matrices. 
Using equation (78) we have 







+ Z D : 


X s 




) 




. Xr . 




. Xr . 





P(X S ,X R ) P(X S ,X R ) 

< log dct (l n + pH SRtD Hl RD j = log(L SRiD ), 
where Hsr,d = [Hsd Hrd\- Using a similar method we obtain 

max I(X s ;Y D \X R ,p 2 ) < log dct (/„ + pH SD Hl D ) = log(L SD ); 

max I{X s ;Y D \pi) < log dct (/„ + pH SD Hl D ) = log(L S L>); 
max 1{X s ;Yr,Y d \px) < logdet (/„ + pH s ,rdHI rd ) = log( J L S;flz? ). 

P(Xs,X R ) \ ' J 
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Finally, substituting the above set of upper bounds in equation (33) and (34) we get 

max I Cs {to) <td log (L s ,rd) + (l-t d ) log (L S d) - Ic s (*<*)> 

P(Xs,X R ) 

max I Cd {td) <td log (Lsd) + (1 - U) log (L S r,d) = I Cd (*<«)• 

P{Xs,X R ) 

This proves the first part of the lemma. 



B. Proof of Part ii 

Let P* represent the distribution where X s ~ CAf(0, -^I m ) and X R ~ C7V(0, and X s and X fl are mutually 
independent. Note that P* satisfies the input power constraints at the source and relay given in (5) and (6). Then 
denoting the mutual information I{Xs,X R ;Yo\p2) evaluated at P* by I(Xs, X R ; Yd \p2) 

I(X s ,X R ;Y D \p 2 

= logdet (l„ + £-H SD Hl D + ^H RD H RDl 





X s 


+ z D - 


X s 








) 




Xr 




Xr 





(«) ( Ir, 

> log dct 



-H SD H SD 



m + k m + k m+ k 

-- logdet (l n + pH S R.DHl R D j -(m + k), 

=l0g(L Sj R,£)) - TO 



H RD H RD 



(79) 



where step (a) follows from the fact that logdct(.) is a monotonically increasing function in the cone of positive 
semi-definite matrices. Using a similar method we get 



I(X s ;Y D \X R ,p 2 ) pt >logdet (J„ + pH SD Hl D ) - to = log(L SD ) - to; 



7(X 5 ; Yd|pi) ^ >logdct (/„ + pH SD Hl D ^ - to = log(L s _D) - to; 



^(^55 Y Rl Y D \pi) ^ >logdct [I n + pH s , RD Hl RD j - to. = log(L S M D ) 
Now, from the definition of C(H,td) in equation (32) we get 

C(%td) = max mm{I Cs (t d ),I CD (td)}, 

{P(X s ,Xr)} 

(a) 

> max mm {I Cs (td), Ic D (td)} , 
{P(Xs ,X R )=P*} 



(80) 
(81) 
(82) 



('<) 



: min|/ Cg (i d ) ,Ic D {td) }> 
>min{/ Cs (t d ) - m,I Cn (t d ) - t d m - (1 -*d)(m + fc)}, 
>min{Ic g (td),Ic D (td)} - (TO + fc), 

where step (a) follows from the fact that instead of maximizing over all possible input distributions, we are 
evaluating the right hand side of the equation at a particular distribution P* and in step (b) we substituted the set 
of lower bounds from equations (79)-(82) in the expressions for Ic s (td) and Ic D (td)- 
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We will prove that 



Appendix C 
Proof of Lemma 4 

R* u = r*(aJ,S)log(p), 



which, when substituted in equation (48), proves the lemma. From the expression of Ls.rd in equation (39) and 
using elementary properties of determinants, we have 



log(L s ,RD) = logdet (l n+ k + pH s . RD Hl RD j , 

= logdet (l m + pHl RD H SMD ) ( = logdet (l m + pHl R H SR + pHl D H SD 



log det ( I m + pHl R H S R (l m + pHI d H S d) ) + log det (l m + pHl D H SD ) , 



logdet I k + pH SR [I m + pHl D H SD ) h\ r + \og(L SD ) 



where in step (a) we have used the fact that H s RD = [Hg R H SD ). Hence, we have 



log 



^S,RD 



D 



= log det I k + pH SR (l m + pHl D H SD 



H 



SI! 



Similarly, from the expression for Lsr,d in equation (38) we get 

Lsr,d 



log 



L 



so 



= logdet I k + pH RD (l n + pH SD H 



S D 



SI! 



(83) 



Now, assuming < \ u < ■ ■ ■ < Ai, < p, v < ■ ■ ■ < \x\ and < 7 9 < • • ■ < 71 represent the ordered non- 
zero (w.p. 1) eigenvalues of the matrices W\ = HsdH\ d , W2 = HsR{I n + pHg D HsD)~ l H SR and W3 = 
H RD (I n + pHsdH\d)~ 1 Hrd and substituting these in equation (42) we get 



log 


(n?=i(i+p^) 


)io g (n?=i(i+PTi)) 


log( 


n?=i(i+p^)) 


+iog(nLi(i+P7o) 



log! JJCi + poi) , 



(84) 



To further simplify the expression on the right hand side of the above equation we use the following transformations: 

A, = p~ ai , 1 < i < u, pbj = p~P j , 1 < j < p and 7; = p~ Sl , 1 < I < q and get 

" ELi(i-^) + E^i(i-ft) + 



r*, 



log(p) 



log(p) E^-^n =r*(aJ,6)log(p). (85) 
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Appendix D 
Proof of theorem 3 

From equation (48) d*(r) is equal to the negative SNR exponent of Pi{Ou}- However, from Lemma 4, Pr{Oy} 
is exponentially equal to Pr{r*(a, /?, 8) < r}. Hence, 

Pr{0^}= p- d ' (r) (86) 

= Pr{r*(aJ,8) < r} (87) 
/ (a, j3, 8) da d(3 dS, 

{(a,0,S)€Ou} 

p -E(a,M) dd d p d ^ (88) 



I {(a,/3,8)EOunS} 

where Ou = {(a, $~, 8) : r*(a, f3, 8) < r} and /(.) is the joint pdf of (a, /3, 8). 

Roughly, the above integral is a sum of an infinite number of terms of the form p- E ( a 'P' S \ one for each (a, /3, 8)- 
tuple in Ou- Laplace's method suggest that at the asymptotic SNR only the term having minimum negative exponent 
dominates, i.e., 

d*(r) = min E(a,j3,6), (89) 
{(a,£,<5)esne>t/} 

where S represents the support set of the joint pdf of (a, (3, 8) given in equation (28) and the expression of E(.) is 
given in equation (29). Suppose at a given r, the objective function attains the minimum value for an a G S where 
ai > 1 for one or more i's. Let a = min{[l, 1, • • • , l],a}, where the minimization is component-wise. Clearly, 
a£5 but at this point E(.) has a strictly smaller value. This proves that in the optimal solution, on G [0, 1] for all 
i. The same is true for (3 and 7. This, however, simplifies both the objective function and the constraint set giving 
the optimization problem in equations (51)-(57) and (58)in the statement of Theorem 3. 

Appendix E 
Proof of Theorem 4 

The proof essentially contains two simplifying steps which consecutively simplify the optimization problem of 
Theorem 3. In the first step a transformation of variables yields an equivalent problem having three variables 
independent of the values of m, k and n. Analyzing the domain of definition of the equivalent problem we find 
that in the optimal solution one of the variables is a function of the other two resulting in an optimization problem 
having only two variables. We start with the first step. 

Step 1: The objective function in (58) decreases strictly monotonically as a, is decreased for any i and the rate of 
decrease with is smaller for a larger value of i. The same is true for (3 and 8. Thus, following a similar method 

as in [12], it can be shown that if X^"=i(l — a i) = a ' Sj=i(l — Pi) = Ya=i(^ ~ = s an d ip-iPt") satisfy 
equations (55)-(57), then the optimal choice of (a, /3,<5) that minimizes F(.) is given by (</> a (a), </>/3(&), 4>s(s)), 
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where 

(l - (a - i + 1) + ) + , 1 < i < u, (90) 
, 1<3<P, (91) 



<Pa(a) = 


[&i,a 2 , ■ 


- iT . « 


Mb) = 




■■Jpf ■■ Pi- 


0s (y) = 




■M T : k = 



\-{s-l + iy) + , l<l<q. (92) 



Denoting by T(a, b, s) the following set 



< (a, ft 5) : ^(1 — a i) = a, — ft ) = — (5;) = s, (a, ft <5) satisfy equations (55)-(57) >, 

i=l j = l 1=1 

we have from the above argument that 

min F((aJJj)=F(<t> a (a)>M b )>M«))- ( 93 ) 

{T(a,M)} 

Let us now define the following set of new variables 

0l = { (0 , M; „ + j£-< r , (o + i )< ra ,(„ + s) <„, 0<a<u,0< b < P ,0<s< q }. (94, 
It is clear from the definition of T{a, b, s) that, 

Oi= |J T(o,M)30. (95) 

{(0,6,3)60!} 

Since the minimum of a function over a set is not larger than the minimum of that function over a subset of it, the 
above relation along with equation (93) imply that 

min F ((j> a {a),(j>B(b),(l)s{s)) = min F(a,f3,5) (96) 
{(»,M)eOi) {(aJ,S)edi} 

< min F(aJ,S). (97) 

{(a,p,S)ed} 

Before proceeding further we take note of a few properties of the newly defined variables a, b and s. From the 
definition of tp^s it is clear that if (a, b, s) € 0\, then (</> Q (a), <^(6), </)5(s)) satisfy equations (52) and (55)-(57). 
Suppose for some (i + j) = (m+1), (ft+ft) < 1, then it can be shown that $Z" =1 (1 — ft) + YTj=\ (1 — ft > m > 
which is impossible. Thus (ft + ft) > 1 for all (i + j) > (m + 1). Similarly, it can be shown that (&i + <jj) > 1 
for all [i + I) > (n + 1), which in turn imply that the (4> a (a), 4>p{b), <fis(s)) tuple also satisfies equations (53) and 
(54). That is (a, b, s) G 0\ => (</> a (a), 0/3(6), <f>s(s)) £ 6 which implies that 

min F(a,f3,S)< min F (</> a (a), 4>a{b), <ps(s)) ■ 

{(a,/3,<5)ee>} {(o,&,s)GOi} 

Combining this with equation (96), we get 

min F(aJ,S) = min F (<j> a (a) , (f>p (b) , <f>5 (s)) . (98) 
{(5,ft«)e6} {(a,6,s)eGi} 

Therefore, we have an equivalent optimization problem to that presented in Theorem 3, but with fewer variables, 
i.e., d*(r) can be equivalently written as 

d*(r)= min F (4> a (a), 4>p{b), <j>s(s)) . (99) 

{(a,6,s)eOi} 
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When (a + b) = m or (a + s) = n, the objective function has a property which we state now that will be helpful 
to solve the minimization problem in the next section. 

Claim 1: The fundctions F (cf> a (a), 4>p(m — a), <fis{s)) and F (4> a (a), 4>p(b), <fis(n — a)) are monotonically de- 
creasing with a for a given s and b, respectively, whereas F (<j) a (a), (f>p(m — a), <f>s{n — a)) is monotonically 
decreasing with a. 

Proof: It can be shown using equations (90)-(92) that when (a + b) = m we have 

{&i + 4') = 1, V(i + j) = (m+ 1) and (d, + fc) < 1, V(i +j) < m. 
Using these relations in the expression for F (&, j3, Tj, after some algebra we get 

P Q U :Q 

F Uk,j3,d \ =y^(m + n + fc + 1 - 2z)a l + ^(n + fc+ 1 - 2l)5 t -ku+ ^ (1 — a,- - <5 ; ) + . 

t=i ;=i i,;=i 

The above function is a strictly monotonically increasing function of &i for each 1 < i < u. Each of the dj's 
in turn is a monotonically decreasing function of a which makes the above function a monotonically decreasing 
function of a. 

Similarly, it can be shown that F (<fi a (a), <f>p(b), 4>s(n — a)) is a monotonically decreasing function of a. However, 
when both (a + b) = m and (a + s) = n, then we have 

(&i + 4) = 1, V (i + j) = (m + 1) and (a, +t3 j )<l, V (i + j) < m; 

(d t + <5 ; ) = 1, V (i + 1) = (n + 1) and (d t + <5 ; ) < 1, V (i + l)< n. 

Using this in the expression for F (a, (3, Sj we get 

p 

F {4> a (a), 4>p{m - a), 4> s (n - a)) = F (a, fj, tj =^(m + n + 1 - 2i)ai, 

which by a similar argument as above is a monotonically decreasing function of a. ■ 
Step 2: In this final step, we determine the minimum of F(.) on 0\ and establish the theorem. Note that if 
a, b, s E 0i, then 6 < min{(m — a),p} and s < min{(n — a), g}. Let us denote these maximum values of b and s 
by b m and s m , respectively. Depending on the value of a the set of feasible (6, s) pairs takes on different shapes 
as shown in the following figures. For example, when a € TZi = {a : ^"'^^'^ < s m } the feasible set of (b, s) 
pairs is the region ABCDE shown in Fig. 7(a). 

For any given value of a the following observations will help us solve the problem: 

• The optimal (6, s) pair always lies on the boundary BCDE or BDE, because the objective function is mono- 
tonically decreasing with both b and s. 
m By the same argument the optimal point on the line segment BC and ED are C and D, respectively. 
Now, we argue that the optimal solution does not lie in 0\ P\H C . Note that when a G 1Z C the optimal solution for 
the (6, s) tuple is point D where (b, s) = (b mi s m ). However, when b = b m we have either b = p or b = (n — a). 
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Fig. 7: Sets of feasible (b, s) tuples for different values of a. 



In both of these cases the objective function is monotonically decreasing with a (e.g., see Claim 1). The same is 
true for s. Therefore, it is clear from the definition of 0\ that when (b, s) = (b m , s m ), a should be (also can be) 
increased until the constraint a + u m ?T < r becomes active. In that case however, we have a e 1Z because 



b m +s ri 



(r-a) 



b m (r - a) 



b m + s m (b m - r + a) 

So, the objective function does not attain its minimum value when a E 1Z C and we need to optimize the objective 
function only over the set 0\ H 7Z. In the definition of 1Z the condition in terms of s m and b m can be converted 
to constraints on a as 

pq 



ip + q) 



, r - y/(m- r)(n - r), a* , <„}, r 



(100) 



where 



n 



2 

m + r 



+ q(n-r); 



+ p(m 



(b+s) 



Also by the previous argument the optimal (b, s) tuple lies on the arc CD and satisfies 
the arc CD b can take any value between point, E where b = b m and F, where b = ^ m ^+1) 



range B = 



(s m —r+a) 



,b r , 



Using these facts, we see that the optimal solution is given by 

b{r — a) 



d*{r)= min F [ 4>a(a), 4>p(b), < 

{a£K, b£B} 



(b- r + a) 



(101) 

: (r — a). Further on 
and thus lies in the 



(102) 
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Appendix F 
Proof of Theorem 5 

To prove the theorem we evaluate the minimum value of the objective function in the optimization problem of 
Theorem 4 over different carefully chosen subsets of the feasible set. The choice of these subsets also helps us to 
obtain a closed form expression for the optimal solution in each subset. The union of these sets might not be equal 
to the feasible set. The minimum of these different optimal solutions represent the minimum value of the objective 
function over a subset of the feasible set and hence yields only an an upper bound to the actual minimum. The 
proof is divided into different cases and each case considers a particular subset of the feasible set. 

Case 1 (0i n {a = ?•}): We know the optimal (a, b 7 s) tuple satisfies a + = T. So, a = r implies either 

b = or s = 0. Since we are considering the symmetric case (m = ri), without loss of generality we assume s = 0. 
From the definition of 4>s we get 5i = 1 for 1 < I < q. Since the objective function is monotonically decreasing 
with b for a given a and s to minimize the objective function the maximum possible value of b should be chosen, 
i.e., b = b m = min{(n — r),p}. We need to consider two different cases: 1) (n — r) < p and 2) (n — r) > p. In 
the first case, b = (n — r) and (a + b) = n which along with Claim 1 implies that 

n p n,q 

dii(r) = minF =^(2n + k - 2i + l)<x t + ^{k + n-2j + l)Sj -kn+ ^ (1 - a t - Sj), 
»=i j=i «,j=i 

j+i<n 

n 

( = ) ^(2ri+fc-2i+l)d i = d n , (n+fe) (r), {n-p)<r<n, (103) 

i=l 

where step (a) is obtained by putting 6i = 1, V/ and the last step follows from the definition of <j> a (a). Next we 
consider the case when (n — a) > p and b — p, which in turn imply (3j = 0, 1 < j < p. Putting this in the objective 
function, we get 

n n.p 

di 2 {r) = minF =V](2n + 2k - 2i + l)6n + -kn + Y] (1 - 

i=l i,j=l 

j-\-i<n 

n 

=J2{2n + k-2i + l)a i = rf ni( „ +fc) (r), 0<r<(n-p). (104) 

i=l 

Combining equations (103) and (104) we get the minimum value of the objective function over the chosen subset 

d Ul = min{dii(r),rfi 2 (r)} = d n ^ n+k) (r), < r < n. (105) 

Case 2 (Oi n {b = s = (n — a)}): Putting b = s = [n — a) in the constraint a + (^pjj = r which is always 
active we get a = (2r — n). Now, a G 1Z if and only if 

max jr - |,0*,2r - nj < (2r - n) (n-|)<r. (106) 
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Since (a + b) = n = (a + s), we know from Claim 1 that the objective function gets simplified to 

n 

d U2 (r) = mini? = ^( 2n ~ 2i + (107) 

i=l 

= d n , n (2r-n) (108) 
= d 2n , 2n (2r), {n - |) < r < n [y a = (2r - n)}. (109) 
Case 3 (Oi Pi {a = 0,6 = £>}): We know from Theorem 4 that a 6 7?. if and only if 

max jr- |,o*,2r-n| < =>■ (n-|)<r. (110) 

Further, from the definition of tp a and <f>p we get /3 3 =0, Vj < p and &» = 1, Vi < n. Putting this in the objective 
function we have 



d U3 (r) = minF= n(n + 2k) + YVfc + n - 21 + 1)8 j - 2kn, 



v 



1=1 



2 + J2(n + k-2l + l)[l-(^---l + l) , 0<r<f,. (Ill) 



where the last step follows from the fact that the optimizing (a, b, s) tuple satisfies 4 

Case 4 (i 
and only if 



____ =r 

Case 4 (Oi f\ {b = s = N,l < N < p}): From the constraint a + = r we get a = r - -y. Now, a € 11 if 



r p i AT AT C A 7 A^ 2 1 

max < r , (2r — n), at > < r , — < r < min < n ,n— -, r r ■ (112) 

I 2' V h n ) - 2' 2 ~ ~ \ 2' (2p-N)j ' 

Since b = s = N, from the definition of <?Vs, we have Sj, f3j = 1, Vj > (N + 1) and 5j, f3j = 0, Vj < N. Putting 

this in the objective function we have 

n p N (n—j) 

d U(3+N) (r) =mmF=^r(2n + 2k-2i + l)&i+ 2 + n - 2j + 1) + -2kn + 2]T ^ (1 - 

i=l j=(N+l) J=l i=l 

(n-N) 

= (2n + 2k-2N -2i + l)a t +N 2 , 

i=i 

2 , ( N\ N f N N 2 } 

= N +d (n _jv),(n+2fc-jv) I r - — I , y < r < mm j n - —,n - _ ^ J . (113) 

Case 5 (0\ n {£> = (n — a), s = N,l < N < p}): We further assume k > n, i.e., p = q = n. Using the fact that 
the optimal solution always lies on the arc CD in Fig. 7(a) we have 



Nin-a) 
(A* + n — a) 



Solving the above equation for a we get 



(n + r) / / (n — r) N 



a w = — t, — — u — n — ) +N{n-r), 1<N< P . (115) 



Recall the optimal solution lies on the arc CD in Fig. 7(a). 
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Now, a £ 1Z if and only if 



max • 



^a* n ,(2r-n),r--\ <^-^-M^-^j +N(n~r), (116) 
which implies 

max \(FT^' n " p /- r - n "Y- (117) 

From Claim 1 we get 

n n n (n— i) 

d c/(3 +P+J v, M = minF= ^(2n + fc - 2z + l)dj + £(fc + n - 2j + 1% - kn + £ ^ (1 - ^ - <$,)• 

i=i j=i i=i ;=i 

Since s = N, from the definition of </>$, we have 5^ = 1, Vj > (iV + 1) and =0, Vj < iV. Putting this in the 
above equation we get 

n n (n-i)AJV 

rfc/( 3+P+ «)W=E(2« + fc-2 t + l)d l -7V(n + fc-/V) + ^ ^ 

i=l i=l 1=1 

y (2n + fc - TV - 2i + l)di — — + ^ (n + A: - % + 1), 

i=l " i=(n,-7Y+l) 

= Y, ( 2n + k-N -2i + l)(l-(a N -i+ 1)+) + N 2 , — ™ < r <n--, 



i=i 



where step (a) follows from the fact that Y^i=ii n — i) ^ N 



N{2n-N-1) 
2 



Appendix G 
Proof of Theorem 6 

The optimization problem of Theorem 4 is solved analytically for m = n = 1. Denoting the optimal solution for 
this case by d(i,fe i i)(r - ) and specializing Theorem 4, we get 

d(i,k,i)(r)= min (2& + l)(l-o)+jfe(l-6) + Jfe(l-s)-2jfe, (118) 



where s = and 1Z and B can be computed from equation (100) and Theorem 4 by setting m = n = 1. 

Since the objective function is symmetric with respect to b and s, without loss of generality, we can assume that 
b > s = I^Lr+i) ■ which in turn implies b > 2(r — a). Differentiating with respect to b, we see that the function in 
(118) is a monotonically decreasing function of b for any given a when b > 2(r — a). It is thus minimized when 
b = max£> = max (1 — a) = (1 — a)- Putting this in the above equation we get 



dn k i)(r) =mm (2fc + 1)(1 - a) + ka + k ( 1 - (1 " )(r , &) ) 



-2k 



--mm (k + 1)(1 - r) + (r - a) (l - ) 

{a} V I 1 ~ r ) 

S v 

T(r-a) 
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k 



(1-r), if \ < r < 1. 



where < (r — a) < (r A (1 — r)). Note that T(r — a) in the above equation is a concave function of (r — a) of 
the form depicted in Fig. 8 (in this figure, k = 3 and r = .35) which intersects the x-axis at (r — a) = Ri 
Thus the objective function is minimized for the following values of (r — a) 



,0, if(r- a) <^; if 0<.<^ ; 

(rA(l-r)), if (r - a) > ^— ^ 




Fig. 8: Plot of T(r - a) vs. (r - a). 

Putting these values for optimal (7- — a) in equation (119), we get the DMT of the (1, k, 1) HD-RC as in (66), 
thus proving the theorem. 

Appendix H 
Proof of Theorem 7 

To prove this result we solve the optimization problem in Theorem 3 analytically for the case m = n and k = 1. 
Let the optimal solution be denoted by d* n l n \{r). In what follows, the computation of d* n 1 n \(r) is carried out in 
two steps. First, we compute an upper bound, dj n 1 * (r) > d*^ n 1 n ^ (r), by computing the minimum of the objective 
function over a carefully chosen subset of the feasible set of the optimization problem in Theorem 3. Then we 
compute the DMT of the static (n, 1, n) RC, denoted as d^^ n \(r), which serves as a lower bound (since dynamic 
protocols include static protocols as a special case). 
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A. The Upper bound 

Putting m = n and k = 1 in the optimization problem of Theorem 3 we get 

n 

d*r„ i „\(r) — min > (2n — 2i + 3)aj+n/?i + nSi — 2n 

i—i 

n-l 

+ ^{(l-/?i-a J ) + + (l-<5 1 -ai)+}. (120) 

i=i 

where 

S = \(a, ft, Si) : V(l - on) + (1 ~ ^ , < r; < ai < a 2 <,-•-,< a„ < 1; 

< ft < 1, < 8i< 1; (ft + a„) > 1, (ft + a„) > l}. (121) 

Now, suppose we reduce the size of S by putting the additional constraints Si = 1 and (a n + ft) = 1 and denote 
the resulting feasible set by S, i.e., 

n 

^{(c^ftA) :^(l-ai)<r; < a x < a 2 <,•■•,< a„ < 1; < ft < 1, < <*i < 1; ft +a„ = l}. 

The objective function minimized over 5 would clearly be an upper bound on dJ 1 n \(r). Denoting the minimum 
over S as 1 n % (r), we have 

n n— 1 

ClnlW= min Y{2n-2i + 3)a l +n/3 1 -n + y(l -ft - a,) + , (122) 

and 

d (n,l,n)( r ) ^ ^fn.l.n)^)- 

Further, since (a n + ft) = 1 we have (an + ft) < 1 for 1 < i < (n — 1) which in turn implies that 

n— 1 n — 1 

5^(1 -ft -<*)+ = 5^(1 -ft -a,). 

i=i ?:=i 

Substituting this in equation (122) and eliminating ft from the objective function we get 

n 

d u (nX n){r) = m_in]T(2n - 2i + 2)a, (123) 
*- a * i=i 

subject to 

n 

— < r and < a<i < a% <,•••,< a n < 1. 

i=i 

This is the optimization problem that arises in solving for the (n + 1, ?i) point-to-point MIMO channel [12]. Hence, 
we have 

rf(n,i,„)W < ^n.i^W = <+!)>), < r < n. (124) 
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B. Lower bound 

The DMT of the static (n, l,n) HD-RC, denoted as d s ^[ n \(r), and shown in Theorem 8 proved in Appendix I 
to be equal to that of the (n + l,n) point-to-point MIMO channel, is a lower bound to dJ 1 „\(r). 

d* {n ,i,n)(r) > <U)( r ) = < r < n. (125) 

Combining this result with the upper bound in (124), we have Theorem 7. 

Appendix I 
Proof of Theorem 8 (Contd.) 

It is proved by mathematical induction that 

= < P +1) ,„(r), 0<r<n. (126) 

For n = 1, the result is given in [28]. Assuming that (126) is true for n = (N — 1), we prove that it is also true 
for n = N. Now, from the objective function in equation (71) and the constraint (72) it is clear that for < r < 1, 
the objective function decays at the fastest rate if cti is decreased with increasing r. 5 Therefore, for < r < 1, the 
optimal solution is a\ = (1 — r) + , a, = 1 for 2 < i < N and ft = 1. Putting this in equation (71) we get 

d$W»0 =2tf(l - r) + ^_ 1)i7V (0), < r < 1; 

- r - L (127) 
On the other hand, since ai = (1 — ?') + for r > 1, substituting ai = in equation (71) we see that the 
optimization problem can be written as 

N N-l 

d?N!i,N)(r) = min V(2iV - 2i + 2)a t + (N - - (JV - 1) + ^ (1 - A - 

(JV-l) JV-l 

= min V (2(iV - 1) - 2i+ 2)oj + (JV - 1)A - (JV - 1) + V (1 - ft - 
subject to the following constraints 

(JV-l) 

y (1 - a*) + -(1 - ft) < (r - 1); < di < a 2 <, • • • < «(jv-i) < 1; < ft < 1 and (ft + a {N _ x) ) > 1. 

Evidently, the solution of (128) at a given r is the DMT of the static (N - 1, 1, N - 1) HD-RC evaluated at r - 1, 
which by the induction assumption is 

dfN\,N){r) = dffiUxN-ifr ~ I), < (r - 1) < (N - 1) 
= dxN-i),N( r - 0<(r-l)<(JV-l) 
= ^D.jvW, 1 < r < iV. 

5 For < r < i, the objective function decays at the same rate if j3\ is decreased, but then for i < r < 1, the objective function decreases 
at a strictly smaller rate than if ct\ was decreased from the beginning. 
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Combining this with equation (127) we get d^y* jv)( r ) = ^fjv+i) iv( r )> < r < TV Hence, by induction we have 
for all iieN, 

<:tn)W = < P +1) ,„(r), 0<r<n. 
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